Skip to content

Conversation

yoyolicoris
Copy link
Member

@yoyolicoris yoyolicoris commented Jul 23, 2025

[------------------------------------- TorchLPC -------------------------------------]
                        |  v07, complex    |  thrust, complex |     v07    |  thrust
4 threads: ---------------------------------------------------------------------------
      [8, 16384, 1]     |        256.6     |       274.6      |     252.5  |    273.6 
      [8, 65536, 1]     |        290.1     |       290.7      |     271.7  |    275.1 
      [8, 262144, 1]    |        483.7     |       495.2      |     416.0  |    308.5 
      [32, 16384, 1]    |        282.9     |       287.9      |     263.0  |    274.5 
      [32, 65536, 1]    |        558.2     |       498.2      |     385.8  |    305.6 
      [32, 262144, 1]   |       1909.9     |      2007.3      |    1380.9  |   1018.4 
      [128, 16384, 1]   |        473.9     |       501.4      |     322.2  |    307.3 
      [128, 65536, 1]   |       1654.8     |      2004.9      |     852.7  |   1018.1 
      [128, 262144, 1]  |       7467.1     |      7928.6      |    4992.5  |   3941.5 

Times are in microseconds (us).
  • thrust performs slightly slower than v0.7 with complex numbers.
  • thrust can be slightly faster on long sequences and large batch sizes with real numbers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant